A robust environment-effects suppression training algorithm for adverse Mandarin speech recognition
نویسندگان
چکیده
In this paper, a new robust training algorithm for the generation of a set of bias-removed, noise-suppressed reference speech HMM models directly from a training database collected in adverse environment suffering with both convolutional channel bias and additive noise is proposed. Its main idea is to incorporate a signal biascompensation operation and a PMC noise-compensation operation into its iterative training process in order to make the resulting speech HMM models more suitable to the given robust speech recognition method using the same signal bias-compensation and PMC noisecompensation operations in the recognition process. Experimental results showed that the speech HMM models it generated outperformed both the clean-speech HMM models and those generated by the conventional k-means algorithm for two adverse Mandarin speech recognition tasks. So it is a promising robust training algorithm.
منابع مشابه
A robust training algorithm for adverse speech recognition
In this paper, a new robust training algorithm is proposed for the generation of a set of bias-removed, noise-suppressed reference speech HMM models in adverse environment suering from both channel bias and additive noise. Its main idea is to incorporate a signal bias-compensation operation and a PMC noise-compensation operation into its iterative training process. This makes the resulting spe...
متن کاملRobust automatic speech recognition for accented Mandarin in car environments
This paper addresses the issues of robust automatic speech recognition (ASR) for accented Mandarin in car environments. A robust front-end is proposed, which adopts a Minimum Mean-Square Error (MMSE) estimator to suppress the background noise in frequency domain, and then implements spectrum smoothing both in time and frequency index to compensate those spectrum components distorted by the nois...
متن کاملOn-line Mandarin Phonetic Symbol Recognition for Video-based Fingertip Input System, " Revised in Journal of Visual Communication and Image Representation
[1] Wei-Tyng Hong, “Hidden Conditional Random Fields for Resource-constrained Speech Recognition”, Advanced Science Letters. (accepted, 2011) (EI, SCI) [2] Wei-Tyng Hong, “An Investigation on Robust Confidence Measure and Model Compensations for Smartphone-based Speech Recognition”, International Journal of Advanced Information Technologies. (accepted, 2011) [3] Wei-Tyng Hong, “Text-independent...
متن کاملNoise Suppression Based on Teager Energy Operator for Improving the Robustness of Asr Front-end
In this paper, we proposed a new noise suppression method based on Teager Energy Operator in advancing the noise robustness of speech recognition front-end. The presented method attempts to remove a distortion estimation in Teager energy domain, especially, a Teager energy estimation of noise signal is subtracted from the noisy speech signal. This approach differs significantly from the traditi...
متن کاملImproving the performance of MFCC for Persian robust speech recognition
The Mel Frequency cepstral coefficients are the most widely used feature in speech recognition but they are very sensitive to noise. In this paper to achieve a satisfactorily performance in Automatic Speech Recognition (ASR) applications we introduce a noise robust new set of MFCC vector estimated through following steps. First, spectral mean normalization is a pre-processing which applies to t...
متن کامل